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Abstract 

Samples of bright stars, as they emerge from surveys such as RAVE, contain com- 
parable fractions of dwarf and giant stars. An efficient separation of these two lu- 
minosity classes is therefore important, especially for studies in which distances are 
estimated through photometric parallax relations. We use the available spectroscopic 
log g estimates from the second RAVE data release (DR2) to assign each star a prob- 
ability for being a dwarf or subgiant/ giant based on mixture model fits to the log g 
distribution in different color bins. We further attempt to use these stars as a labeled 
training set in order to classify stars which lack log g estimates into dwarfs and giants 
with a SVM algorithm. We assess the performance of this classification against dif- 
ferent choices of the input feature vector. In particular, we use different combinations 
of reduced proper motions, 2MASS JHK, DENIS UK and USNO-B B2R2 apparent 
magnitudes. Our study shows that - for our color ranges - the infrared bands alone 
provide no relevant information to separate dwarfs and giants. Even when optical 
bands and reduced proper motions are added, the fraction of true giants classified as 
dwarfs (the contamination) remains above 20%. 

Using only the dwarfs with available spectroscopic logg and distance estimates 
(the latter from Breddels et al. 2010), we then repeat the stream search by Klement, 
Fuchs & Rix (2008, KFR08), which assumed all stars were dwarfs and claimed the 
discovery of a new stellar stream at V « -160 km s" 1 in a sample of 7015 stars 
from RAVE DR1. The existence of the KFR08 stream has been supported by two 
recent studies using other independent datasets. Our re-analysis of the pure DR2 dwarf 
sample exhibits an overdensity of 5 stars at the phase-space position of the KFR08 
stream, with a metallicity distribution that appears inconsistent with that of stars at 
comparably low rotational velocities. Compared to several smooth Milky Way models, 
the mean standardized deviation of the KFR08 stream is only marginal at 1.6 + 0.4. 
Our data therefore do not allow to draw definite conclusions about its existence, but 
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future RAVE data releases and other (best spectroscopic) surveys are going to help in 
resolving this issue. 

Subject headings: Galaxy: solar neighborhood — stars: late-type — stars: statistics 
— methods: statistical 



1. Introduction 



In the last decade we have witnessed a tremendous increase of astronomical data due to 



large automatic sur veys such a s 2MASS (ISkrutskie et al. 120061) . SDSS/SEGUE dYork et al. Il200d : 
Yanny et al. Il2009|) or RAVE (ISteinmetz et al. II2006Q . The data produced by such surveys - ap- 
parent magnitudes and/or spectra - need further processing to determine the type and properties 
of the observed sources. This step often requires automatic algorithms that search for regularities 
in the data and use this information to classify the data into different categories (classification) 
or to determine the values of certain parameters (regression). Machine learning techniques are 
ideally s uited for such purposes , and their applications have dramatically grown in astronomy. For 
example. iRichards et al. I (120041) used the location of 16,713 spectroscopically classified quasars in 
SDSS color-space to identify 100,565 quasar candidates in a sample of SDSS DR1 point sources 
based on kernel density estimation. Similarly, using neural networks trained on sp ectroscopically 



pre-c lassified high-redshift SDSS quasars with optical photometry and radio data, ICarballo et al. 



(|2008[) obtained 58 high-redshift quasar candidates in data taken from the FIRST survey; spec- 
troscopic follow-up observations confir med that 60% of the c andid ates were true high-redshift 
quasars. A study from the same year by iGao. Zhang & Zhao I (|2008|) compared the performance 
of Support Vector Machines and ^-dimensional trees in the classification of quasars using different 
combinations of SDSS and 2MASS colors and magnitudes. They concluded that both algorithms 
yield high efficiency; SVMs have slightly higher accur acy but need also more co mputation time. 
An example of automated regression can be found in iRe Fiorentin et al. I (I2007|) : these authors 
used non-linear regression models trained either on pre-classified observed SDSS/SEGUE spectra 
or synthetic spectra to estimate the stellar atmospheric parameters r e ff, logg and [Fe/H]. 

The mai n goal of the presen t study is to identify dwarfs in the second RAVE public data 
release (DR2, IZwitter et al. 1 12008|) in order to build a sample with minimal contamination from 



subgiants and giants. Only for such a sample, reliable distance estimates are possible, either from 
photometric parallaxes (not used i n this paper) or the re cently published RAVE distances which 



data release (DRl. lSteinmetz et al 



were derived fro m stellar models (Breddels et al. 1120101 her eafter B+10). DR2 includes th e first 



20061) . which was used by lKlement. Fuchs & Rix I ((2008, here 



after KFR08) to claim the discovery of a new stellar stream in the solar neighborhood. Their anal- 
ysis was based on the assumption that the vast majority of RAVE stars are main sequence stars; as 
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we describe in the next section, this assu mption is questionable and deserves deeper investigation. 



We therefore try to redo the analysis from lKFR08l on a preferably large and pure dwarf sample and 
hope to recover a signal of the new stream. 

The reason that DR1, and also part of DR2, contains no estimate for the stellar parameter 
logg, which would allow a discrimination between dwarf and giant, is that the corresponding 
second-order spectra are contaminated by blue light, because a later installed blue-blocking filter 
(Schott OG513) was not used during the first year. However, part of DR2 contains estimates for 
logg, and we use these to build a training set of stars for automatic classification of the others 
based on features like photometry and proper motions. 

This paper is organized as follows: In §[2l we show that early and late-type stars with available 
logg estimates naturally fall into two or three distinct classes, respectively, which we naively 
interpret as dwarfs, subgiants and giants. We can thus assign class labels to these stars and use them 
to investigate whether it is possible to classify stars into dwarfs and "others" based on photometry 
and/or proper motions alone. This is the subject of § [3j where we show that such a classification, 
performed by a Support Vector Machine (SVM) algorithm, can not yield an output dwarf sample 
in which the contamination from giants is below ~ 20%. In § |4l we therefore use only the dwarfs 
that have b een labelled on the basis of their spectroscopic lo g,? estim ates together with distances 



from lB+lOl to search for stellar streams with our method from KFR08. We show that we are able to 
recover a signal of the proposed new stream, which is however not highly significant. Our results 
are summerized in § [5] 



2. The Mixture Model Classification 



2.1. The Data 



Based on photometric parallaxes calibrated with Hipparcos data, in IKFR081 we searched for 
overdensities in the phase space of nearby (d < 500 pc) stars drawn from DR1 and detected a 
feature located at an azimuthal velocity of V « -160 km s _1 with respect to the Local Standard 
of Rest. This feature contained 5 stars in a region of phase space that proved to be significantly 
clumped compared to a smooth Milky Way model (cr > 2), while another 14 stars just outside this 
region probably also belonged to this stellar "stream". This was based on the assumption that "the 
vast majority of RAVE stars are main sequence stars, because giants in the RAVE magnitude range 
would lie so far away, that the star density is substantially decreased." 



However. ISeabroke et al. I (|2008) have estimated that only ~ 5% of the RAVE stars are K-M 
dwarfs, while K-M giants account for ~ 44%. We therefore wish to critically check our assumption 
by using the stars from DR2 with spectroscopic log g estimates to build a new sample of dwarfs 
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with minimal contamination from the other luminosity classes. The logg values were derived 
by a penalized y 1 m ethod to construct a synthetic spectrum that matches the observed spectrum 
(|Zwitter et al. Il2008[) . The observed spectrum is modelled as a weighted sum of Kurucz template 
spectra with known parameters and it is assumed that the parameters follow the same weight rela- 
tion. From a comparison to external datasets, the typical logg uncertainties for stars with a typical 
S/N ratio of 40 range from ~ 0. 15 dex fo r dwarfs with T c s = 9000 K to ~ 0.7 dex for cooler dwarfs 
with T eS = 6000 K dzwitter et al. II2008L Fig. 19). 



Recently, |B+10j published distance estimates for 16,075 stars in the DR2 catalog with typical 
relative uncertainties in the range 30%-50%. These distances were derived by fitting the observed 
log g, T e ff , [M/H] an d (J - K) values of ea ch star to one out of 600 theoretical isochrones from 
the Y 2 (Yonsei- Yale. [Demarque et al. Il2004|) models, in this way obtaining absolute magnitudes in 
the J band. Uncertainties on the distances have been computed out of the probability distribution 
function which is obtained by drawing a sample of 5000 Monte-Carlo stars from a 4D Gaussian 
centered on the model isochrone with standard deviation in each parameter equal to the uncertainty 
of the corresponding observed parameter of the true star, and again finding the best-matching 
isochrone for each of the 5000 Monte Carlo stars. By comparing their distances to those of main 
sequence stars from the Hipparcos catalog and giants in the open cluster M67, iB+lOl concluded 
that their distances for the main sequence stars are reliable, with no systematic errors exceeding 
~ 10%, whereas the giants were systematically offset from the expected M67 isochrone. This 
again highlights our wish to build a pure sample of dwarfs based on the available log g estimates. 



2.2. Creating a labeled sample of Dwarfs and Giants based on mixture models 



Before we start investigating the log g distribution, we take the following considerations into 
account: first, the gravity parameter of a given luminosity class changes with spectral type (color). 
For example, log g is expected to be lower for G dwarfs than for M dwarfs (of the same metallicity). 
Second, reddening of stars leads to a less clear separation of dwarfs from other luminosity classes 
in the log g distribution, because a reddened giant will have a higher log g value than an unreddened 
one with the same color, while for dwarfs the opposite is true (although in the RAVE magnitude 
range reddening will primarily effect the more distant giants). Third, the dwarf-giant separation 
will get better with increasing color, because the number of turn-off s tars and subgiants decreases. 
Fourth, the expected small fraction of dwarfs (ISeabroke et al. 1 12008|) in combination with their 
higher log g uncertainties might hinde r a clear separation. Finally, the log g distribution has been 
shown to vary with galactic latitude b (lZwitter et al. II2008L Fig. 22). 



Keeping these considerations in mind, we prepare the DR2 sample for our purposes in the 
following way: of the 51,829 objects contained in DR2, only 49,327 objects are unique, the rest 



-5- 



being repeated observations of single stars (|Zwitter et al. II2008L Table 1). If a star has been ob- 
served multiple times, we assign it the mean radial velocity of these observations. Next, we remove 
all stars without photometry in all of the 2MASS JHK bands, because these bands are the most 
comprehensive ones and least affected by extinction. We further decrease any redding effects by 
keeping only stars with latitudes b > 20°. This step has the additional bene fit that the number of 
giants goes down due to their decreasing space density in the halo (see also IZwitter et al. II2008L 
Fig. 23). Our cuts leave 48,335 individual stars of which a subset of 20,080 have estimates for 
log g. To account for the varying fraction of dwarfs and subgiants/giants with galactic latitude, we 
derive logg distributions separately for stars with 20° < \b\ < 40° and 40° < \b\ < 90°, respectively. 

These are displayed in Figures \T\ and [2l where we have separated stars with (J - K) < 0.5 (the 
"early-type" subsample, upper left panel) from those with (J-K) > 0.5 (the "late type" subsample, 
which is further bin ned into the other panels). According to synthetic 2MASS colors calculated by 
Covey et al. I (|2007l Table 3), a solar-metallicity K0 star has (J-K) = 0.54. Hence the color cut at 
(J-K) = 0.5 roughly separates K and M stars from earlier types. From Figures [T] and |2] it is also 
apparent that it divides our DR2 sample into two parts with respect to the fraction of giants: in the 
early type subsample, the contamination from giants seems to be negligible; the log ^-distribution 
can be fitted by a single Gaussian with mean /u = 4.2 dex and standard deviation a = 0.44 dex, 
independent of latitude b. In this subsample, the fraction of stars that lie outside of the 2<x-region 
of the Gaussian (at logg < 3.09, which we decide to attribute to giants) is less than 3.9%. The 
Gaussian most likely contains dwarfs as well as turnoff-stars and subgiants, because the transitions 
in this color range are continuous. In the late-type subsample, on the other hand, we clearly have 
more than one pop ulation of stars, and the logg distribut ions change with colorlll We use an EM- 
algorithm (see e.g. lHastie. Tibshirani & Friedman II2008L Section 8.5) to fit the log ^-distributions 
with mixture models of multiple Gaussians, in this way obtaining probability densities without 
using class labels (unsupervised classification). We found that in general, three Gaussians fit the 
observed distributions well, while adding more components can result in overfitting of the data@ 



The log g distribution in the early type subsample changes much less prominent with color, so that we marginalized 
over colors. 

2 The fit to the high-latitude stars in the color range 0.8 < J - K < 0.9 is not so easy to interpret, because we found 
that the solution of the EM algorithm in this color range is not unique: if we choose initial mean values of log g > 3 
for two of the Gaussians, the algorithm converges to a different solution, which fits the distribution at log g > 3 with 
two narrow Gaussians caused by the "gap" at \ogg ~ 4.2. However, stars in this color range should not have yet 
developed into subgiants, so that two narrow Gaussians are not only statistically, but also physically unrealistic; a 
better explanation for the gap would be the large uncertainties in the logg values for red stars. The lack of subgiants 
should allow a two-component fit and indeed we find that a two-component fit is almost indistinguishable from the 
three-component one (black dotted line in Fig.|2|. We nevertheless adopt the three-component fit to be consistent with 
the other color bins, knowing that the separation of dwarfs will be very similar. 
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This shows that we can generally decompose stars in the different color bins into three classes, 
which we can think of being dwarfs, subgiants/intermediates and giants. An exeption is the color 
range (7 - K) > 0.9, where we do not fit the data, because in this color range the signature of 
the dwarf population is washed out due to their small number and large logg un certainties. We 



therefore do not use stars with (J - K) > 0.9 (M4/M5 and later. Icovey et al. 2007 ) in our further 



analysis, also because the se stars fall out of t he range of the synthetic spectra which were used to 



obtain stellar parameters (|Zwitter et al. II2008L Sect. 3.2.). 



The parameters of the different mixture models are given in Tabled] As expected, the weight- 
ing of the median Gaussian component associated with the contribution from subgiants decreases 
with increasing color, from 0.57 at 0.5 < 7 - K < 0.6 to 0.06 at 0.8 < 7 - K < 0.9 for the low 
latitude sources. Also, the fraction of the third component associated with dwarfs is higher at high 
galactic latitudes for all color bins except 0.5 < J — K < 0.6, w here it is roughly constant at ~ 30%. 



This trend has already been described by [Zwitter et al. I (120081) and is attributed to the fact that the 



RAVE giants trace a systematically more distant population and already belong to the scarcely 
populated halo at high latitudes. There is a clear trend of the means of all three components to shift 
to lower values with increasing color, e.g. from log g = 4.4 at 0.5 < J - K < 0.6 to log g = 4.2 at 
0.8 < J - K < 0.9 for the dwarfs, reflecing the decreasing stellar masses of giants, subgiants and 
dwarfs for later types. This is accompanied by a trend of the standard deviations to increase, which 
is caused by the fact that the log g uncertainties increase with color. The color bin 0.8 < J—K < 0.9 
allows the clearest separation between dwarfs and giants due to the very few subgiants, expressed 
through a relatively large standard deviation of the dwarfish Gaussian component. 

From Figures Q] and [2] and Table \T\ we see that the three-component mixture models fit the 
observed log ^-distributions reasonably well and - as a result - predict the late-type dwarfs to be 
located around logg « 4.4. Adopting this model, we calculate a star's probability for being a dwarf 
as 



m ni ,/ m,n tt 3 (7 - K)N(x i \ » 3 (J - K, \b\), cr,{J - K, \b\)) 

jp(dwarf logg, (7 - K), \b\) = — - , (1) 

iLi *k(J ~ K, \b\)N(x, | /u k (J - K, \b\), cr k (J - K, \b\)) 



where the index 3 refers to the Gaussian component on the right side of the log g-distribution, that 
we choose to interpret as representing the dwarfs, and the parameters depend on the color (7 - K) 
and the absolute value of latitude \b\ given in Table [Q 

In what follows, we simply combine subgiants, giants and supergiants into a single class 
"giants", because we are only interested in separating the late-type dwarfs from these other lumi- 
nosity classes. We consider a star as a dwarf if it has tp(dwarf| logg) > 0.5, and likewise a giant if 
fp(dwarf| logg) < 0.5; the thresholds are tabulated in Table [T] In this way, the late type subsample 
(0.5 <(J-K)< 0.9) comprises 1752 "dwarfs" (20.3%) and 6882 "giants". 



Table 1 . Parameters of the mixture models 



(J-K) I 71 n <T \oggt 

mag (dex) (dex) (dex) 



0.5. 


.0.6 


(0.12,0.57,0.31) 


(2.25,3.40,4.38) 


(0.38,0.45,0.30) 


4.02 


(0.31,0.41,0.28) 


(2.72,3.61,4.43) 


(0.54,0.41,0.25) 


4.11 


0.6. 


.0.7 


(0.39,0.55,0.06) 


(2.17,2.82,4.43) 


(0.27,0.64,0.27) 


4.10 


(0.25,0.59,0.16) 


(2.18,2.79,4.48) 


(0.26,0.68,0.24) 


4.07 


0.7. 


.0.8 


(0.57,0.34,0.09) 


(2.10,2.51,4.46) 


(0.34,0.86,0.33) 


3.98 


(0.44,0.31,0.25) 


(1.97,2.54,4.45) 


(0.43,0.97,0.26) 


3.92 


0.8. 


.0.9 


(0.73,0.06,0.21) 


(1.54,2.66,4.20) 


(0.46,0.73,0.38) 


3.50 


(0.45,0.08,0.47) 


(1.41,2.89,4.18) 


(0.42,0.76,0.45) 


3.21 



Note. — In each row, the upper part corresponds to the stars with 20° < \b\ < 
40°, the lower part to those with 40° < \b\ < 90°. The columns give the valid 
color range for the mixture model, the vector of mixing coefficients n, the vectors 
of means fi and standard deviations cr of the Gaussian components, and log g t , 
the threshold value where the probability of a star for being a dwarf gets greater 
than 0.5. The latter is computed by assuming that the dwarfs are described by 
the third Gaussian at the right of the distribution. 
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J-Kg0.5 0.5<J-Kg0.6 0.6<J-Kg0.7 




og g log g log g 



Fig. 1. — logg histograms for stars at galactic latitudes 20° < \b\ < 40° and with (J - K) < 0.5 
{upper left panel) and five bins of stars with (7 - K) > 0.5 (other panels). The curves show 
mixture model fits to what we consider a dwarf and a "giant" population. For the early type stars, 
the distribution can be fitted by a single Gaussian, while for the late type stars three components 
are fit (red curve). The blue dashed curve gives a fit with four components and looks almost 
indistinguishable to the three-component fit. The logg distributions do not extend beyond 5.0 dex, 
which is the grid limit of the spectral libraries for RAVE. 



3. The SVM Classification 



In the previous section, we have classified the late-type stars having spectroscopic logg esti- 
mates with the help of an unsupervised mixture model classification. In this section, our goal is 
to use these stars to investigate whether it is possible to classify the ones that lack logg estimates 
into dwarfs and giants based on some other features by a supervised machine-learning algorithm. 
Before we decide on which features to use, we face the problem of selecting an appropriate train- 
ing set of the two classes. One idea would be to cut the logg-distribution at, e.g, logg = 3.5 and 
4.0, respectively, to distinguish d warfs from giants. A lthough such a choice would account for the 
mean log g uncertainty of 0.5 dex (|Zwitter et al. Il2008|) in some sense and create two class samples 
that could be considered as pure, it assumes totally correct log g values and can not account for the 
logg uncertainty of the individual stars. In addition, the exact choice of the cutoffs is somewhat 
arbitrary and not directly obvious from the logg-distributions (Figured)). Finally, by leaving out a 
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J-K^O.5 



0.5<J-K^0.6 



0.6<J-Kg0.7 

N=1469 




Fig. 2. — Same as Figured] but now for stars with 40° < \b\ < 90°. For the color bin 0.8 < J - K < 
0.9 we additionally show a two-component mixture model fit (black dotted line) since subgiants 
should be negligible in this color range. 



"gap region" in the log ^-distribution one looses information from the stars that might be scattered 
into this region due to their logg uncertainties. 

A solution to this problem is to work in a fully probabilistic framework. Although for the 
classification of stars with unknown logg estimates we still need to assign a "true" class to each 
star we put into our training set, we could still weight them during the classification according to 
their logg values and hence their probability p that they truly belong to their assigned class. 

We therefore assign a weight w to each star in a given class by mapping its probability for that 
class onto the interval [0, 1]£] 

( 2 (^(dwarfj log g) - 0.5) if p(dwarf| log g) > 0.5 
w = { (2) 
[2 (1 - p(dwarf| logg) - 0.5) ifp(dwarf| logg) < 0.5. 

For the machine learning algorithm we choose a SVM, because it is highly adaptable and 



Although in principal weights could range from to oo, we choose this scaling because it puts sufficient weight 
onto stars with higher class probabilities and makes conversions between p and w easier to interpret. 
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capable of dealing with a multidimensional feature space. The SVM identifies a decision boundary 
in a multidimensional space based on a training set containing examples of two or more classes and 
assigns class probabilities to each object acco rding to its distance from the decision boundary. We 
use a Java-coded implementation of LIBS VM qChang & Lin which is available onlinen. We 

adopted the tool "Weights for data instances" written by Ming-Wei Chang and Hsuan-Tien Lin into 
our Java code@ This tool allows the user to weight the input data by creating an individual penalty 
parameter C for each data point. In our case, we use the weights w obtained from equation © 
to weight each dwarf and giant in the training set. The training set itself is constructed from the 
subset of late-type stars with available logg estimates - the 1752 "dwarfs" and the 6882 "giants" 
with 0.5 <(J-K)< 0.9. 



3.1. Choosing Appropriate Features for the Dwarf-Giant Classification 

We now explore which features are generally suited to distinguish dwarfs from giants. One 
feature that has often been used to separate dwarfs from giants is the reduced proper motion. 
Figure [3] shows the (7 - K) reduced proper motion diagram for all dwarfs (black dots) and giants 



(red d ots) in the combined early- and late-type subsamples (compare to Fig. 13 of ISeabroke et al. 



2008). We do not show stars with no given or improbably high (>1000 mas/yr) proper motions. 
This leaves 1743 dwarfs and 6877 giants in the late-type subsample and 10756 dwarfs and 208 
giants in the early-type subsample, respectively. In the early-type subsample, we consider a star 
as a giant if it lies outside the 2<x region of the Gaussian curve in the upper left panel of Figured] 
Note that we make no distinction between dwarfs, turnoff-stars and subgiants in the early-type 
subsample and include these stars in Figure [3] only for illustration, because we not use them in the 
SVM classification. The reduced proper motion, H K , is 

H K = 5 + 5 log// + K = 5 log v x + M K - 3.379, (3) 

where /i is the total proper motion (in arcsec/year), v ± the total transvers e space velocity (in km 



s l ) and M K the absolute magnitude in the ^-band. Like in Figure 13 of Seabro ke et al. I (|2008), 



we show the solar-scaled Padova 2MASS isochrone (|Bonatto. Bica & Girardi Il2004|) as a repre 



sentation of a thin disk population (Z=0.019, 2.5 Gyr, v± = 60 km s 

Assuming that the transverse velocities of dwarfs and giants of the same spectral type are 
equally distributed, the reduced proper motion is a proxy for the absolute magnitude. In Figure [31 



4 http : / / www .csie.ntu.edu.tw/~cjlin/ libsvm/ 

5 http : / / www .csie.ntu.edu.tw/~cjlin/ libsvmtools/ #15 



The v± = 30 km s in ISeabroke et al. I d2008l) is a typographical error (Seabroke, private communication). 
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Fig. 3. — Reduced proper motion diagram for stars in the color range < (j - K) < 0.9 that have 
been classified based on their spectroscopically derived lo g # values and the mixture mode ls shown 
in Figured] The black solid line is a fiducial sequence from lBonatto. Bica & Girardi I (|2004 ) chosen 
to represent a typical thin disc population. The black dotted lines are the approximate separations 
between K-M dwarfs and giants made by ISeabroke et al. I (|2008|) . The black star symbols denote 
members of the putative KFR08 stream; t he black diamonds are members in the most significant 
(<r > 2) region of the stream in Fig. 10 of lKFR08l . Note that four of the putative stream members 
lie in the color range (J - K) > 0.9, where we do not try to classify stars. 



we fur ther show the separation lines between K-M dwarfs and giants adopted by ISeabroke et al 



(I2008|) based on the dwarf-giant bifurcation in their Figure 13. We also overplotted the 19 members 
of the stellar stream that we have found in our previous analysis of DR1 data (the 'KFR08' stream): 
the blue dia monds de note members that lie in the region of phase space where the significance in 
Figure 10 of KFR08 is greater than 2 (at V ~ -160 km s -1 ), while the green stars are the other 
puta tive mem bers that lie in a still overdense region localized by -180 km s" 1 < V < -140 km s" 1 
(see IkFR08L Section 4.3.). The kinematic separation implies that we classify many of the dwarfs 
as giants, most notably in the color range 0.8 < (J - K) < 0.9 (see also Figured]). Also, we now 
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realize that a considerable proportion of the putative stream stars are probably giants, and a more 
detailed analysis reveals that they have been projected into the same part of phase space because 
of their similar large radial velocities. 



It is obvious from Figure [3] and Figure 13 in ISeabroke et al. I (|2008l) that the reduced proper 
motion, H K , might be a way to separate dwarfs and giants in the late-type subsample. However, 
classifying stars in this way introduces a kinematic bias, which is undesirable in most applications. 
A dwarf-giant classifi cation method that is particularly interesting for our study has been described 
by iBilir et al. I (|2006|) . These authors used linear decision boundaries in the space spanned by the 
apparent V and 2MASS magnitudes, (V, J, H, K), to separate FGK dwar fs (selected by logg > 
4) from K giants (logg < 3). As pointed out by iMajewski et al. I (I2003|) . dwarfs and giants are 
degenerate in the infrared colors alone, except for the latest types (J - K > 0.85) where opacity 
effects play a role. We therefore expect to need at least one optical band in order to obtain good 
classification results. 



Motivated by the work of IBilir et al. I (I2006|) . we show in Figure |4] the position of the two 
classes of our late-type subsample in different color-color diagrams. Because only a small fraction 
of stars have available photometry in the Tycho V T band, which is the closest available band to 
V, we choose to use the photographic USNO-B B2 and R2 bands instead]^ In addition, we also 
make use of the DENIS IK J bands. These constraints on available photometry reduce our sample 
to 1305 dwarfs and 5488 giants. The diagrams already suggest that a classification of dwarfs 
and giants based on these colors might be difficult, because the dwarfs are distributed on top of the 
giants and do not occupy separate regions. This highlights the advantage of using a SVM classifier, 
namely that it has non-linear decision boundaries and that the weighting enables us to account for 
uncertainties in the training data. The next section describes the outcome of our classification 
attempts. 



3.2. Evaluating the Classifier Performance 

In this section, we examine the performance of the SVM classifier according to the use of 
different combinations of the following input features: (7, J, K)v Ems , (J, H, K $, (B2,R2) and Hr . 
The choice of these features is motivated by the figures [3] and |4] and the work of IBilir et al. I (|2006|) . 
It leaves us 1305 dwarfs and 5488 giants which we can use for training and testing. Through 



7 The V band (A e ff = 551 nm, FWH M=88 nm) lies b etween B2 and R2, which span the wavelength range of 385-540 
nm and 590-690 nm, respectively (IMonet et al. 1120031) . 

8 7 and K magnitudes without index refer to 2MASS, while for DENIS magnitudes we explicitly add an index. 
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Fig. 4. — Color-color diagrams of 1304 dwarfs (black dots) and 5488 giants (gray dots) in our 
training set. All stars shown occupy the color range 0.5 < (J - K) < 0.9 and therefore are mostly 
spectral types K and M. 



various tests, we found out that feeding the classifier with apparent magnitudes instead of colors 
yields statistically similar results, which shows that it is able to "learn" the right combinations of 
magnitudes by itself. 

Generally, it is not a prior i clear how the imbalanc e in the class distribution will affect a 
classifier. It has been shown by IWeiss & Provost I (|2003r) that the natural distribution not always 
performs best, while a balanced distribution in general performs well§ Although these authors 
used decision trees for two-class classification, whic h are expected to be more sensitive to class 
imbalances than SVMs (IJapkowicz & Stephen 1120 02). it has been recommended that a SVM clas- 



9 "Performing well" in this context means achieving a high completeness and low contamination; see below for the 
definition of completeness and contamination. 
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sifier should be trained on roughly equal n umbers of objects in each class so that it can properly 
learn the class distributions or boundaries ( Bailer- Jones et al. 20081) . We therefore decide to use 
a balanced training set and use the formalism of iBailer- Jones et al. I to adjust the classifier output 
probabilities according to the probabilities we a priori expect for each class. For the latter we adopt 
a simple prior given by the relative fractions of dwarfs and giants that result from our mixture mod- 
els; these are 20.3% dwarfs, 79.7% giants. In this context we emphasize that for dwarfs and giants 
the priors strongly depend on the apparent magnitude limits of the survey. In our case, the stars 
have been drawn fro m the Tycho-2 and S uperCQSMQS catalogs in the Cousins /-band magnitude 
range 9 < / < 12 (|Steinmetz et al. Il200q) . If RAVE reached fainter magnitudes, the fraction of 
giants would decrease, thereby requiring less corrections for class imbalances. 

In the machine learning literature there exist various measures of a classifier's performance 
(e.g. I Weiss & Provost I I2003L Section 4.3). The principle is to split the training set, for which the 
true classes i are known, into a training and a test set and in this way assess the performance of 
the classifier. We can do the classification in two ways: either we assign an object to the (output) 
class j for which its probability is highest (in our case > 0.5); or we set a probability threshold 
P t and assign an object to class j only if its probability for belonging to that class is higher than 
P t . The former approach is best evaluated in the form of a confusion matrix giving true positive, 
false negative, false pos itive and true negative rates for the positive class (dwarfs) (see Tableland 
Weiss & Provost II2003L Section 4.3). The latter approach, however, is more suited for our purpose 
of building a pure dwarf sample, because it allows a more confident classification. Note that the 
higher the threshold, the lower the sample contamination, but the smaller its completeness. The 
completeness measures how many percent of objects in the test set with true class i are contained 
within the output sample for class j = i. The contamination for class j gives the percentage of 
objects in the output sam ple for class j that are truly of other classes (see also equation (1) of 
Bailer- Jones et al. 1120081) . 



We have employed both approaches for different feature vectors x that contain different com- 
binations of the input features. We randomly select roughly half of the dwarfs and approximately 
the same number of giants into a new training set and use the rest of the sample to build a (labeled) 
test set. Although our training set weights each star according to its probability for truly belonging 
to the assigned class, the performance measures do not take into account these weights, only class 
labels. If we put stars into the test set independent of their weights, the classifier performance 
measures are affected by the stars which have small weights (meaning their class assignment is 
uncertain), but are taken to have true known classes. To investigate this effect, we build test sets in 
two ways: in the first case, we only put stars with weights w > 0.8 into the test set, corresponding to 
class probabilities p > 0.90 In the second case, we put no constraints on the weights. We resam- 



°This is equivalent to cutting the log ^-distribution of the late-type stars at \ogg = 3.49 and logg = 2.70, respec- 
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ple the training and testing sets in this way 100 times, each time computing the confusion matrix 
and completeness and contamination in the output dwarf sample for a range of different threshold 
probabilities P t . Finally, we take the mean values and standard deviations of these performance 
measures to compare the classifier performance between different input features. Our experiments 
are summarized in Table [3] in order of decreasing contamination at a threshold P t = 0.7, since 
contamination is the most important measure for the purity of the dwarf output sampled Note 
that the case in which the test sets include stars with small weights can be treated as the worst case 
which gives a lower limit on accuracy and an upper limit on the contamination. 

The outcome of the experiments have similar trends for both cases of test sets. The infrared 
bands alone seem to hold no valuable information about how to draw a decision boundary. This un- 
derlines the argu ment that a dwarf-giant separation from near-infrared bands alone is only possible 



for J - K > 0.85 (|Majewski et al. Il2003|) . Because the SVM is trained by maximizing accuracy, it 
basically classifies every star as a giant. This also explains the very large standard deviation of the 
contamination. Adding the USNO-B B2 and R2 bands (experiments 2 and 3) improves the perfor- 
mance measures in a way that implies that the SVM has learned a decision boundary that separates 
at least a small fraction of dwarfs from the giants. It is likely that a better separation is hindered 
by the relatively large uncertainties of USNO-B apparent magnitudes. The best performance in 
terms of accuracy, completeness and contamination is obtained when 2MASS magnitudes and re- 
duced proper motions are combined, independent of adding other photometric bands. Generally, 
inclusion of reduced proper motions in the feature space yields significantly higher accuracy and 
completeness. However, besides the fact that selecting stars based on reduced proper motions in- 
troduces a kinematic bias, the contamination of the output dwarf sample is still too high for our 
purpose of building a pure dwarf sample. 

We give a qualitative interpretation of the trends described above in Figure \5\ In the upper 
panels we show the trend of completeness and contamination with probability threshold for the 
dwarf sample for one realization of experiments 1, 3 and 6, where in each case we had identical test 
sets with no constraints on the weights (case 2). We see that we can hardly push the contamination 
below ~ 20%, even if we adopt thresholds P t > 0.7. 

The lower panels of Figure[5]show the posterior probabilities for class "dwarf" given the fea- 
ture vector x, i 3 (dwarf|x). In experiment 1 there is only a very weak trend for stars to get higher 
probability for class "dwarf" with increasing log g; in addition, the probabilities stay below 0.5 for 
most of the stars even in the high-log g regime. Adding B2 and R2 to the infrared 2MASS magni- 



tively, to distinguish dwarfs from giants. 



"The choice of P, = 0.7 will be justified later in Figure|5] because at this threshold the drop in contamination starts 
to be more shallow. 
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tudes (experiment 3) improves the classification results in the sense that now the aforementioned 
trend is more pronounced with a non negligible fraction of stars with logg > 4 having probabili- 
ties for class "dwarf" higher than 0.5. Finally, adding reduced proper motions to the feature vector 
transfers even more of the stars with log g > 4 that had P(dwarf \x) < 0.5 in experiments 1 and 3 to 
probabilities higher than 0.5. 

We conclude from our experiments that we can not use the photometric data and/or reduced 
proper motions of our late-type subsample to classify sources in the RAVE catalog that lack log g 
estimates, if we wish to obtain an output sample with contamination below 5-10%. Making use of 
reduced proper motions and choosing a high threshold probability, we could eventually lower the 
contamination to 15-20%, but at the expense of a low and rapidly falling completeness (Figure©. 
This means that, at least for RAVE data, it is a very good and essential thing that future data re- 
leases will contain spectroscopically derived stellar parameters. However, several successful, albeit 
non-probabili stic, dwarf-giant separations based on photometry alone have been described in the 
literature (e.g. becker II 19521 : llVIajewski et al. 112003 : iBilir et al. IboodEfeig Ibood) . We believe that 
the classification of stars without spectra will remain very i mportant for ongoin g and future large- 
scale surveys such as PanStarrs (srizy bands, see Fig. 1 in lStubbs et al. 112007b and LSST (ugrizy 
bands, see Fig. 4 in llvezic et al. Il2008|) . The accurate photometry of these surveys, spanning the 
optical range from ~ 350 nm (w-band) to ~ l//m (y-band), combined with infrared photometry 
from, e.g., 2MASS (~l.2jum-2.2jum) should yield much better results than in our case, where we 
had to rely on a narrow color range and USNO-B photometry derived from photometric plates with 
large uncertainties. The use of machine-learning techniques such as the SVM used by us would 
then allow for fully probabilistic classification with the advantage that additional knowledge (pri- 
ors, other classifiers) could be easily incorporated. 



In the next section we repeat our kinematic stream search from IKFR08L for which a pure 
sample of dwarfs is needed. Due to the unsatisfactory classification results obtained in this section, 
we will only make use of the stars that we classified as dwarfs based on their logg estimates and 
the mixture models in § [2j 



4. Application 

In the previous section we have shown that we can not make use of the RAVE data that 
lack log g estimates, because their classification based on the available photometry and/or reduced 
proper motions is not reliable enough. We therefore decide to join together the "dwarfs" from the 
early and late type subsamples that we classified based on th e mixt ure models in § 12.21 For these 



stars, we expect that we can use the distance estimates from lB+10l with higher confidence than if 



we would simply use all available stars independent of their luminosity class. The reason is that 
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Table 2. Confusion matrix 





dwarf 


giant 


DWARF 


TP 


FN 


GIANT 


FP 


TN 



Note. — The fields TP, 
FN, FP and TN denote the 
true positive, false negative, 
false positive and true nega- 
tive rates for the dwarf sam- 
ple, respectively. 



Table 3. Results of our experiments for two cases of test sets (see text) 



No. 


used features x 


TP 


FN 


FP 


TN 


accuracy 


compo.v 


conto.7 






(%) 


(%) 


(%) 


(%) 


(%) 


(%) 


(%) 








j 4.0 


96.0 


0.5 


99.5 


51.8 ±0.8 


0.7 


31.9 ± 24.4 


(1) 


{J,H,K),(I,J,K) DEms 




\4.0 


96.0 


0.7 


99.3 


51.7 ±0.7 


0.7 


35.6 + 23.1 








15.6 


84.4 


2.3 


97.7 


56.7 ± 1.3 


8.5 


58.0 + 8.1 


(2) 


(I,J,K) DEms ,B2,R2 


1 


k 13.7 


86.3 


2.3 


97.7 


55.7 ± 1.3 


7.1 


52.0 ± 6.0 








22.5 


77.5 


2.3 


97.7 


60.1 ± 1.6 


14.1 


40.1 + 5.0 


(3) 


(J,H,K), B2,R2 


i 
















k 20.3 


79.7 


2.8 


97.2 


58.8 ± 1.4 


11.5 


41.6 + 5.6 


(4) 






' 56.9 


45.8 


2.9 


95.7 


77.1 ± 1.4 


40.7 


27.4 ± 3.2 


(J,H,K),B2,R2,Hk 


i 








74.9 ± 1.1 








54.2 


43.1 


4.3 


97.2 


37.6 


32.0 ± 2.3 








' 59.6 


40.4 


3.2 


96.8 


78.2 ± 1.4 


43.4 


27.8 ± 3.4 


(5) 


(J, H, K), (I, J, K) DENls , B2, R2, H K 


i 


^56.4 


43.6 


4.8 


95.2 


75.8 ± 1.2 


39.4 


32.0 ± 2.5 


(6) 


(J,H,K),H K 




' 58.5 


41.5 


3.1 


96.9 


77.7 ± 1.2 


42.7 


27.1 + 2.9 


i 








75.4 ± 1.0 






55.6 


44.4 


4.7 


95.3 


37.3 


31.2 + 2.5 



Note. — For each experiment, the outcomes for case 1 are given in the top line, for case 2 in the bottom line, x denotes 
the feature vector. TP, FN, FP and TN are defined in Table [2] Classification accuracy is defined as the fraction of correctly 
classified objects, i.e., T p + -|-^|,p + ] JK ■ The columns compo.7 and conto.7 denote the completeness and contamination of the dwarf 
output sample when using a threshold probability P t = 0.7. All values are given as the means obtained from 100 resamplings 
of training and test sets. For the accuracy and contamination we also quote the standard deviations. 
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Fig. 5. — Upper panels: Completeness and contamination of the dwarf output sample against 
P, for experiments 1, 3 and 6. Lower panels: Posterior probability of objects in the labeled test 
set for belonging to class "dwarf" given the feature vector x vs. the logg estimate for the same 
experiments. Components of x can be looked up in Table |3] 



the stellar isochrones from which Ib+IG derived their distances are systematically offset from the 



location of real giants. Although we have stated that the early type " dwarfs" contain an unknown 



fraction of subgiants and turnoff stars, we are confident that the iB+lOi distances are reliable enough 
for these stars. In fact, these authors suggested being careful when using their distances for stars 
with log g < 3 only, which is well below the dwarf-giant threshold in our early-type subsample 
(at logg = 3.32). There are, however, subgiants from later types with logg < 3 (Table [D, which 
justifies their exclusion from our sample. 



We used the TOPCAT Tool written by Mark Taylor to cross-match the Breddels et al. 



catalog with our combined dwarf sample. The resulting sample includes 9696 stars. For our 
kinematic analysis, we further restricted the sample to stars with distances less than 500 pc, rel- 



http ://www. star.bris . ac . uk/~mbt/topcat/ 
13 available from http://www.astro.rug.nl/~rave/ 
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ative distance errors less than 40%, proper motion uncertainties less than 5 mas (in both compo- 
nents) and radial velocity uncertainties less than 5 km s _1 . This leaves 3 508 stars among which 



we searched for stellar streams in the same way as described in KFR08. That is, we search for 
overdensities in the space spanned by (V,(U 2 + 2V 2 ) 1/2 ), which are measures for the angular mo- 
mentum and eccentricity of disk-like orbits passing the solar neighborhood. We applied a wavelet 
transform with scale and elongation para meters a = 7 km s -1 and q = V3, respectively, to enhance 



the overdensities (for more details see KFR08). To calculate their significance, we compare them 
against a smooth model that we derive from Monte Carlo simulations. The smooth model is a com- 
posite of three Schwarzschild distributions to represent the thin disk, thick disk and halo. There 
exists, however, a range of parameters of the three Schwarzschild distributions in the literature; the 
same is true for the thick-to-thin disk and halo-to-thin disk normalizations. Therefore, we test the 
significance of the overdensities against various smooth models and tabulate the results in Table|4] 
In each case, we added additional velocity errors that we drew from a Gaussian distribution with 
the standard deviation set to the standard deviation of the velocity uncertainties in our sample (7.3 
and 5.7 km s _1 for U and V, respectively). For every model, we drew 500 Monte Carlo samples and 
for each individual sample computed the wavelet transform with the same parameters that we used 
for the data. The significance is derived based on the Z-statistic, Z = (wy - vv^ c )/cr (i , where 
denotes the values of the wavelet transform for the data, vv^ c the mean obtained from the Monte 
Carlo samples and <r i; the standard deviation, in each case for bin (i, j). 

We demonstrate this principle of deriving significance levels for all overdensities on the basis 
of model 1 and Figure [6j The upper left panel displays the distribution of our sample stars in 
(V,(U 2 + 2V 2 ) 1/2 ) after applying the wavelet transform in order to enhance overdense regions. We 
can see hints of several well-known overdensities: Hercules at (V,(U 2 + 2V 2 ) 1/2 ) » (-50, 85) km 



s t he Hyades-Pleiades streams at (-20, 40) km s 1 and Sirius at (+4,9) km s 1 (IFamaey et al. 



2005b or the 'AF06' and Arcturus streams, which occupy th e region V « -80 km s 1 « -110 



km s \ respectively (Eg gen 1996 : Arifyanto & Fuchs 2006). In addition, there is an overdensity 



of 5 stars in the region of the KFR08 stream at -160 km s _1 < V < -140 km s _1 . The upper 
right and lower left panels show the mean value and standard deviation of the wavelet transform 
of the 500 Monte Carlo samples, respectively. In contrast to a single Monte Carlo sample, the 
mean value is a very smooth distribution, against which we compare our sample. By dividing the 
difference between the upper left and upper right panels through the lower left panel, we obtain 
the significance map showing Z, the standardized departure from the smooth model (lower right 
panel) Note that only features with Z > 2 are plotted. Z measures the likelihood of our data 



14 Like in KFR08, we set the standard deviation equal to one in bins where it is less than one. Also, we set the value 
of the wavelet transform for our sample and the mean of the Monte Carlo samples to zero whenever it is less than zero, 
because we are only interested in overdense regions of (V,(U 2 + 2V 2 ) 1/2 )-space. 
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Table 4. Parameters and fractions of the Schwarzschild distributions 





o-u 


cry 




/ 


Zkfros 


p ■ 

1 mm 




(kms- 1 ) 


(km s^ 1 ) 


(kms^ 1 ) 


(%) 






model 1 


(30,60,150) 


(19,40,95) 


(-10,-40,-220) 


(14,0.5) 


2.2 


0.09 


model 2 


(30,60,150) 


(19,40,95) 


(-10,-40,-220) 


(14,1.0) 


1.6 


0.22 


model 3 


(39,63,150) 


(20,39,95) 


(-12,-51,-220) 


(14,0.5) 


1.5 


0.24 


model 4 


(39,63,150) 


(20,39,95) 


(-12,-51,-220) 


(14,1.0) 


1.2 


0.33 


model 5 


(31,59,141) 


(18,48,92) 


(-10,-38,-200) 


(12,0.5) 


1.2 


0.33 


model 6 


(39,63,150) 


(20,39,95) 


(-12,-40,-220) 


(14,1.0) 


1.7 


0.20 



Note. — All velocities are given in the order thin disk, thick disk and halo, respectively. The normalizations 
/ are given in the order (thick disk, halo). Zkfros denotes the Z-score for the KFR08 stream, calculated as the 
mean of the two bins centered at (V az , Vae) = (— 143, 217) km s and (-141, 215) km s . P mm is the minimum 
posterior probability for the hypotheses that the KFR08 overdensity is compatible with the smooth model, obtained 
by calibrating the Z-score (i.e. p- value) such that an interpretation as minimum Bayes factor or minimum posterior 
probability is justified and assuming a prior of 0.5 for the smooth model (see e.g. lHeldl2010h . 

1 Chosen to gi ve a reasonable fi t to t he overall velocity distribution of our s ample. The normalizations agree with 
those given bv ljuric et al. I {20081) and lSoubiran, B ienavme & Siebertl |2003|) . 

2 As model 2, but fraction of halo stars increased to 1%. 

3,4 Parameters for thin and thick disk taken from Soubiran, Bienavme & Siebert j |2003|) . 

Parameters chosen as m e an v al ues from the litera t ure: Thin disk velocity ellipsoids from 
ISoubiran. Bienavme & Siebertl |2003|) ; lAllende Prieto et al. I {2004), thick disk velocity ellipso ids and 
normalizations from |Chiba & Beers (2000); Siegel et a ll 120021) ; ISoubiran. Bienavme & S iebert <2003l) ; 
lArifvanto et al. I l2005t); lju ric et al . I j2008l); ICarollo et al. I |2010h. halo velocity ellip soids and normalizations 
from lChiba & Beers I j2000l) ; |juric et al. I l2008l) ; ISmifh et al. I l2009l) ; lBond et aTl J2010l) . 

6 Chosen to investigate the effect of varying V^g for the thick disk. Compare to model 4. 
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assuming a smooth Milky Way model without kinematic substructure. 

First we note that we do not want to make any inferences about features at regions in the 
(V,(U 2 + 2V 2 ) 1/2 )-plane, where our smooth model predicts an underdensity of stars. These are the 
two features with V velocities between 30 and 70 km s _1 and the tiny spot at (V,(U 2 + 2V 2 ) 1/2 ) = 
(-70, 172) km s" 1 . Due to the fact that underdensities artificially increase the signal, we have only 
indicated the position of the stars associated with these overdensities with thick crosses in Figure[6l 
but do not make any further inferences about their possible association with stellar streams. 

From the overdensities that we consider real, the most outstanding ones in the significance 
map are the three well-known streams Hercules (Z = 6.5), Hyades-Pleiades (Z = 3.1) and Sirius 
(Z = 6.3). These are highly significant for all parameter combinations covered by the models 1-6 
and are believed to be dynamical str eams, resulting from the perturbations th at spiral arms and/or 



the bar induce in the local potential (lFamaey et al.ll2005l : lAntoja et al. 1 12009). At lower rotational 



velocities, neither the AF06 (Z = 0.8) nor the Arcturus (Z = 0.05) streams are significant enough 
to appear in the lower right panel of Figure [6] In fact, the overdensities attributed to these streams 
in the upper left panel are fully consistent with arising by chance in any of our smooth Milky Way 
models given in tabled 

Finally, we detect a small signal of the KFR08 feature in which we are mostly interested in 
this study. It is centered at (V,(U 2 + 2V 2 ) 1/2 ) = (-142, 116) km s" 1 in Figure |6l which departs 
with Z = 2.2 from the smooth model. In this region, the mean value and standard deviation of 
the wavelet transform of the 500 Monte Carlo samples ranges from 0.2-0.7 and from 2.0-2.3, 
respectively, implying no false amplification of the KFR08 signal. However, we found that the 
significance of the KFR08 feature given any other smooth Milky Way model besides model 1 is 
not high enough to give convincing evidence for its realness (see Table H]). Besides the choice 
of the halo normalization, the rotational lag of the thick disk population seems to have a major 
impact on the statistical significance of the KFR08 feature (compare models 4 and 6). From all 
six models, we infer a mean significance level of Z = 1.6 + 0.4, corresponding to a (one-sided) p- 
value of 0.05^° Q3. The /?-value is the probability for obtaining a Z- value equal to or more extreme 
than the observed one assuming that the KFR08 overdensity is consistent with a smooth model 
(the null hypothesis H ). However, we wish to infer P(H Q \D), the probability of this hypothesis 
given our data. We obtain this posterior probability using Bayes theorem; this requires fixing a 
prior probability P(H ) for H and computing the likelihood for a second hypothesis (Hi), e.g. a 
particular model that explains the observed overdensity: 



P(D\H ) ■ P(H ) + P(D\H X ) • (1 - P(H )) 

P(D\H ) P(H Q ) , r i ^ ; 



" I l P(£>i#i) ' i-P(H y I 
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Here, P(D\H ) and P{D\H\) are the likelihoods for the null and alternative hypothesis, respectively, 
and the prior of the alternative has been expressed through the prior of H (as we assume that 
Hq and Hi are mutually exclusive and complete). The ratio is also known as the Bayes 

factor; smaller values of the Bayes factor provide stronger evidence against Hq. Given the marginal 
significance of our detection, we do not consider it worth comparing different alternative models, 
but only assume that the alternative hypothesis is the one which maximizes the likelihood of our 
data and therefore minimizes the Bayes factor. In this way, we at least estimate the minimum 
posterior probability for the null hypothesis that the KFR08 overdensity is consistent with a smooth 
model. Assuming that the alternative hypothesis is the one which maximizes the likelihood of 
the data implies that its test statistic, Z\, is distributed such that its mean is at the observed Z- 
value. If we fu rther assume a normal distribution for Z], the minimum Bayes factor is given 



by exp(-0.5Z 2 ) ( Edwards. Lindman & Savage II 1963c iHeldlboiOr) . We give the null hypothesis a 



50% prior probability (i.e. we a priori give equal chance to this or any other hypothesis). From 
eqation ©, the minimum posterior probability for the smooth-model hypotheses then comes out 
to be in the range 0.09-0.33. These minimum probabilities for each model are tabulated in Table HI 
Other conversions between the test statistic (or p-value) and minimum Bayes factors are possible 
and usually result in somewhat larger values for the minimum posterior of the null hypothesis 



(Held 1 1201 Oh . We therefore conclude that although the simple smooth halo models tested here 
seem unlikely based on their Z-values, their actual probabilities do not allow us to rule them out as 
the cause for the observed KFR08 overdensity. 

There are 5 stars that we associate with the KFR08 signal. We have marked their position in 
the significance map of Figure [6] with black crosses and give their velocities, distances and astro- 
physical parameters in table [5] A Kolmogorov-Smirnov test of their calibrated metal abundances^ 
against those of all other stars in the range -220 km s^ 1 < V < -100 km s _1 (N = 20) gives 
a probability of 0.81 that both distributions differ. However, this too is not a significant result, 
and a larger number of stars would certainly help drawing definite conclusions about an eventual 
chemical homogeneity of putative KFR08 stream members. We note that eviden ce for the exis 



tence of the KFR08 stream has already been found in two independent datasets: Klement et al 



(|2009|) . analyzing data from the SDSS DR7, described two features at the approximate position 
of the 'RAVE' stream, of which one ('Rl') had systematically lower [Fe/H] values than the other 
('R2'). The [Fe/H] values of the 'Rl' strea m are thereby consistent with photo metric [Fe/H] esti- 



mates for KFR08 candidate stars derived by lBobylev. Bajkova & Myll ari (20 10J). Based on revised 



versions of the Hipparcos and Geneva-Copenhagen survey catalogs. Bobylev. Bajkova & Myllari 



identified 19 putative stream members of the KFR08 stream, comprised of dwarfs, turnoff stars 



15 The typical uncertainties for the KFR08 stars are 0.2 dex ( Zwitter et al. 20081) . but do not get considered in the 
K-S test. 
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and giants with relative uncertainties on the parallax of less than 15% and characterized through 
«V>,ov) = (-151,11) km s" 1 and «[Fe/H]), cr [Fe/H] ) = (-0.7,0.3). A comparison with the V 
velocities of the stars we associate with the KFR08 stream (((V), cr v ) = (-146, 5) km s™ 1 , see Ta- 
ble HI) gives a fairly good agreement, altough we note that two stars have very large uncertainties 
in their V velocity components. By fitti ng the Y 2 isochrones to the color-magnitude diagram of 
their stars. iBobylev. Bajkova & Myllari I derived an age of 13 Gyr for the KFR08 stream. They ar- 
gued that "the homogeneity of the kinematics, chemical composition, and age of the sample stars 
is consistent with the hypothesis that the stream is a relic remnant of the galaxy captured and dis- 
rupted by the tidal effect of our own Galaxy", but also hypothesized about a possible connection 
to the Arcturus stream, which would need greater statistics to decide about. Our sample suffers 
from small statistics, too, but if we assume the KFR08 stream to exist and the five stars tabulated in 
Table H] to be its members, then the high vertical velocities for four of these stars (\W\ > 50 km/s) 
more likely support the idea that the KPR08 stream i s comprised of halo s tars rather than having a 
dynamical origin in the (thick) disk (as suggested by LMinchev et al. Il2009|) . 



5. Conclusions 

With the release of the second RAVE data release, which contains logg values for 14,772 
individual stars, we were able to create a set of stars with probabilities for belonging to the two 
classes "dwarf" and "giant" by fitting the logg distributions with multiple component mixture 
models. The estimated high fraction of giants calls for a critical re-analysis of our recent study 



(KFR08). For this purpose we tested whether we could use the stars to which we had assigned 
class labels to classify the ones without logg estimates in order to enlarge the dwarf sample. We 
restricted ourselves to the color range 0.5 < J - K < 0.9, because there the association between 
one component of the mixture model and the dwarfs was clearest. Using a SVM algorithm, we 
investigated which features would be suitable for that task by splitting our labeled sample into a 
training and test set and evaluating the classifier performance for different feature vectors. These 
comprised various combinations of photometr ic bands and reduced proper motions. As already 



suggested by Figure |4] and arguments made by Majewski et al. I (|2003k the classifier was not able 



to draw a meaningful boundary between dwarfs and giants from infrared magnitudes alone; the 
SVM classified nearly all the stars as giants in order to achieve the highest possible accuracy 
(~ 50%). Adding the optical B2 and R 2 magnitude s impr oved the classification. It is difficult 



to compare our result to the findings of iBilir et al. I (120061) . who separated dwarfs from giants 



based on their apparent V and (/, H, K) magnitudes alone: First, these authors do not give any 
performance measure of their classification. From a close inspection of their Fig. 1, it seems that 
the giant contamination in the dwarf sample is very small, but it should also be noted that their 
samples lack any intermediate stars with 3 < logg < 4 that might add further confusion. Second, 
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B2 a nd R2 are derived fr om photographic plates and posess relatively large uncertainties (~ 0.25 



mag, 



Monet et al. Il2003|) . Third, the V ba nd is locate d between B2 and R2, so that our results can 

Our best results in terms of accuracy and 



not be directly compared to the work of Bil ir et al 
contamination have been obtained when we used reduced proper motions in the feature vector. 
However, even with a classification accuracy around 75%, the contamination in the output dwarf 
sample was higher than 25% (Table [3]). We therefore conclude that photometric data for RAVE 
stars together with reduced proper motions alone are not adequate when the goal is to build a 
sample of dwarfs with minimal contamination from giants. 



Being confident about the classifications we obtained from the mixture models in § I2.2[ we 
decided to take the labeled dwarfs from bo th the early and late-type subsamples and combine 
them with the distance estimates given in the |B+10j catalog. These stars posses high enough logg 
parameters to have reliable distance estimates. Finally, we only kept stars with low uncertainties 
on the proper motion and radial velocity (<r Ma < 5 mas, <r Ms < 5 mas, <x Vrad < 5 km s _1 ) and relative 
uncertainties on the distance less than 40%. 

In the res ulting s ample of 3508 stars, we searched for nearby stellar streams in the same way 
as described in lKFROa . This requires looking for overdensities in the space spanned by (V,(U 2 + 
2V 2 ) 1/2 ), which are proxies for a nearby star's angular momentum and eccentricity. Due to the 
variablility of Galactic model parameters given in the literature, we repeated our analysis for five 
different choices of parameters (tabulated in Table©. We detected the well-known moving groups 
Sirius, Hyades-Pleiades and Hercules at high confidence levels for every choice of parameters. We 
also found hints for the existence of the AF06 and Arcturus streams in the data, but they turned 
out to be consistent with resulting by chance from a smooth distribution (Z < 1). We found that 
the KFR08 stream is only marginally significant and reaches our required level of 2.0 only for one 
particular model (model 1). The choice of the halo normalization as well as the rotational lag of the 
thick disk component seem to be critical to the inferred significance level. For five stars associated 
with the KFR08 feature, the metallicity distribution deviates from that of the other stars in the range 
-220 < V < -100 km s" 1 at 81% confidence. We point out that the number of stars in this range, 
which are used in the test, is small (in total /V = 25). We also note that the V-velocities of two 
of the putative KFR08 stars posess large uncertainties (Table [5]). We therefore can neither confirm 
nor discard the realness of the KFR08 stream with any degree of confidence. Ulimately, one has to 
search for its signature in larger and/or indepe ndent data sets. Recently, evidence for its existence 



has b een found in two independent datasets (IKlement et al. 1 12009c iBobylev. Bajkova & Myllari 



2010 ). Hopefully, stronger constraints on the realness of the 'RAVE' stream can be put forward by 
RAVE itself as future data releases become available. 
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Fig. 6. — Upper left panel: Contours of the wavelet transform of our sample's d istributio n in 
(V,((U 2 +2V 2 ) 1/2 )). The same bin width and skewed Mexican hat kernel function as in KFR08l have 
been used. Upper right panel: Mean value of the wavelet transform of 500 Monte Carlo samples 
drawn from a Schwarzschild distribution with the "model 1" parameters given in Table HI Lower 
left panel: Standard deviation of the wavelet transform of the 500 Monte Carlo samples. Lower 
right panel: Significance map for our sample, showing only contours where the overdensities are 
> 2 standard deviations above the smooth model mean. The thick crosses mark the position of stars 
from the KFR08 stream and those associated with the two features at V > 30 km s" 1 (see text). 
The color bars give the values of the wavelet transform or the significance levels, respectively, on 
a linear scale. 



Table 5. Velocities, distances and astrophysical parameters for the five stars associated with the 

KFR08 feature in this study. 



Object ID 


Target ID 


logg 


7*eff 


[M/H] 


J-K 


U 




V 


cry 


W 




d 








(dex) 


K 


(dex) 


(mag) 


(tar 1 ) 


(kmr 1 ) 


(kmr 1 ) 


(kmr 1 ) 


(kmr 1 ) 


(kmr 1 ) 


(pc) 


(pc) 


C0622232-573246 


J062223 .3-573246 


4.74 


6597 


-o.io 


0.34 


71 


22 


-141 


7 


-24 


1 1.6 


466 


142 


T8496_00634_l 


1032150.0-552442 


4.31 


5744 


-0.22 


0.52 


-8 


7 


-145 


8 


-135 


7 


119 


46 


T4900_01632_l 


1093415.8-054505 


4.46 


5919 


-0.03 


0.48 


-50 


6 


-145 


7 


90 


5 


21 1 


67 


T8023_00143_l 


J001548.9-450433 


4.27 


5847 


-0.13 


0.43 


89 


30 


-145 


57 


59 


20 


311 


118 


C0229387-130746 


J022938.7-130746 


4.71 


6916 


-0.82 


0.36 


-97 


8 


-155 


44 


-135 


7 


422 


123 



All param eters are taken from the catalog of 



Breddels et al. 



(I2010I) . [M/H] denotes the calibrated metallicity 



estimate (see also lZwitter et al. 1120081) . (U, V, W) are the Cartesian velocity coordinates with respect to the Local Stan- 
dard of Rest and (cru, cry, o~w) their corresponding uncertainties, d and cr^ denote the distance from the sun and its 
uncertainty estimate. 
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